# 16kHz Audio Adaptation
Whisper Medium Vaani Telugu
MIT
A Telugu automatic speech recognition model based on OpenAI Whisper-small architecture, optimized for Indian languages by the ARTPARK-IISc team
Speech Recognition Other
W
ARTPARK-IISc
26
1
Hubert Large Superb Ks
Apache-2.0
Keyword detection model based on Hubert-Large architecture, excelling in SUPERB benchmark tests
Speech Recognition
Transformers English

H
superb
78
0
Viwav2vec2 Base 100h
Apache-2.0
A base Wav2Vec2 model pretrained on 100 hours of unlabeled Vietnamese speech audio from the VLSP dataset, requiring fine-tuning for downstream tasks.
Speech Recognition
Transformers Other

V
dragonSwing
19
0
Wav2vec2 Large Xlsr Bengali
A Bengali automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained using OpenSLR dataset.
Speech Recognition
Transformers

W
tanmoyio
24.32k
3
Sew Tiny 100k
Apache-2.0
SEW-tiny is a compressed and efficient speech pretraining model developed by ASAPP Research, pretrained on 16kHz sampled speech audio, suitable for various downstream speech tasks.
Speech Recognition
Transformers Supports Multiple Languages

S
asapp
1,080
3
Wav2vec2 Large Xlsr Hindi Marathi
Apache-2.0
Fine-tuned based on Facebook's wav2vec2-large-xlsr-53 model, supporting automatic speech recognition tasks for Hindi and Marathi
Speech Recognition
Transformers Other

W
tanmaylaud
76
0
Unispeech 1350 En 17h Ky Ft 1h
A speech recognition model based on Microsoft's UniSpeech architecture, specifically fine-tuned for the Kyrgyz language
Speech Recognition
Transformers Other

U
microsoft
39
1
Sew D Base Plus 400k Ft Ls100h
Apache-2.0
SEW-D-base+ is an efficient speech recognition model developed by ASAPP Research, pre-trained on 16kHz sampled speech audio, and excels on the LibriSpeech dataset.
Speech Recognition
Transformers English

S
asapp
66
4
Featured Recommended AI Models